Corpus: vie_wikipedia_2007_10K

Other corpora

5.2.18 Words nearly always together in sentences

Strong sentence co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/together_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency together Qoutient
Sài Gòn 33 30 28 1.26
Gòn Sài 30 33 28 1.26
Dựa Thijs 24 21 21 1.14
Thijs Dựa 21 24 21 1.14
khổng lồ 21 21 21 1.00
lồ khổng 21 21 21 1.00
lũng thung 15 12 11 1.49
thuẫn mâu 15 13 13 1.15
mâu thuẫn 13 15 13 1.15
thung lũng 12 15 11 1.49
arrondissement Franche-Comté 7 5 5 1.40
arrondissement canton 7 7 7 1.00
arrondissement département 7 7 7 1.00
canton Franche-Comté 7 5 5 1.40
canton département 7 7 7 1.00
canton arrondissement 7 7 7 1.00
département Franche-Comté 7 5 5 1.40
département canton 7 7 7 1.00
département arrondissement 7 7 7 1.00
láng giềng 7 5 5 1.40
345 msec needed at 2018-01-27 11:05